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The cirrhosis and cirrhosis-related problems are connected to the degree of 
fibrosis in the liver. The purpose of this paper is to propose an automated 
method for identifying liver fibrosis using ultrasound shear wave elastography 
(700) images that is based on a hybrid machine learning approach using a 
convolutional neural network (CNN) with two types of classifier (SoftMax 
and support vector machine (SVM)). The dataset gathered from hospitals is 
used in the training and testing phases of the model. The objective is to 
develop a hybrid machine learning model that can classify images based on 
their stage of fibrosis. The suggested system comprises three stages. The first 
is the preprocessing step, which starts with countor detection and continues 
with the "contrast limited adaptive histogram equalization (CLAHE)" 
technique to show the properties of liver tissue. In the second step, the CNN 
algorithm was utilized, which was based on several images to extract deep 


features and identify shear wave elastography (SWE) samples. In the third 
step, SVM and SoftMax functions are used to classify liver fibrosis. A five- 
class model (normal, F1, F2, F3, and F4) was developed. The result illustrates 
how successfully the CNN-SoftMax and CNN-SVM classifiers classified 
liver fibrosis in the test dataset, with 97.18% and 98.59% accuracy, 
respectively. 


This is an open access article under the CC _BY-SA license. 


Corresponding Author: 


Zainab Sattar Jabbar 

Department of Biomedical Engineering, College of Engineering, Al-Nahrain University 
Al Jadriyah Bridge, Baghdad, Iraq 

Email: st.zainab.s.jabbar @ced.nahrainuniv.edu.iq 


1. INTRODUCTION 

Liver fibrosis develops as a result of structural or functional changes in the liver induced by viruses, 
immunological responses, and toxic assaults [1], [2]. The severity of liver fibrosis is linked to the risk of 
developing cirrhosis and cirrhosis-related complications [3]. The standard method for assessing liver fibrosis 
is currently a liver biopsy. However, it does have some drawbacks. It is an invasive procedure, with severe 
complications occurring in up to 1% of biopsy cases [4], [5]. Because biopsies only make up a small fraction 
of the overall liver, there is bound to be some sampling error [6]. 

As a result, noninvasive methods for detecting liver fibrosis, such as transient elastography, shear 
wave elastography, and magnetic resonance elastography, are becoming more popular [7]-[9]. The use of 
ultrasound shear wave elastography to identify advanced liver fibrosis has shown benefits as a non-invasive 
and low-cost approach. Ultrasound is used to create and measure tissue shear waves, which are then utilized to 
determine tissue stiffness. To determine tissue stiffness and fibrosis staging, a technician must manually select 
tissue for shear wave elastography (SWE) excitation using ultrasound imaging guidance, then measure tissue 
stiffness over a manually specified field of interest (ROI) within the stimulated tissue and also average tissue 
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stiffness from several sources (3—10) independently acquired images. The current process's manual nature 
increases inter-operator variability and output quality, with blood arteries, bile ducts, and the liver capsule 
frequently being included in the ROI by mistake, resulting in unusually high shear wave speeds for the beneath 
liver parenchyma [10]. 

Lee et al. [11] in 2020, developed a four model (FO vs. FI vs. F23 vs. F4). A total of 13,608 
ultrasonography images from 3446 patients were used to train a diffusion-convolutional neural networks 
(DCNN). The accuracy of the four-class model was 76.4%. Dandan et al. [12] in 2019 proposed a strategy for 
classifying diffuse liver disorders based on multimodal aspects in ultrasound images. The convolutional neural 
network (CNN) was used to extract image structure characteristics, and the multi-scale gray-level co-occurrence 
matrix (MGLCM) and wavelet multi-sub-bands co-occurrence matrix (WMCOM) were used to extract image 
texture features WMCM. These two types of characteristics were combined to form multimodal features, which 
were then fed into a light gradient boosting machine (GBM) classifier as input. The proposed method was used 
to classify 2942 liver ultrasound images. Normal, fatty liver disease, and liver fibrosis classification accuracies 
are 82.1%, 85.0%, and 80.9%, respectively. Gatos et al. [13] in 2017 proposed a computer aided diagnosis system 
that uses to classify chronic liver disease from ultrasound shear wave elastography images. To quantify 85 
pictures, the suggested algorithm uses an inverse mapping technique. The support vector machines (SVM) model 
had the highest classification accuracy of 87.0%. Deep learning technology was used to diagnose cirrhosis by Liu 
et al. [14]. CNN was used by Liu et al. [14] in 2017 in their study to create characteristics from ultrasound pictures. 
Liu et al. [14] used SVM as a classifier to discriminate between normal and abnormal livers, and the proposed 
method's accuracy was 96%. Kagadis et al. [15] in 2020 compared certain popular deep learning pre-trained 
networks with and without augmentation, using temporally stable and complete elastograms, and proposed 
acceptable deep learning strategies for chronic liver disease diagnosis and progression assessment. A total of 200 
chronic liver disease patients with a verified liver biopsy were evaluated using the conventional ultrasound 
(US)/SWE method. Four pictures of the same part of the liver were saved and analyzed in order to obtain 
elastograms and rule out temporally unstable areas. Googlnet, Alexnet, Vgg16, Resnet50, and Densenet201, with 
and without augmentation, were fed complete and temporally stable masked elastograms for each patient. The 
networks were tested for recognizing chronic liver disease stages in seven classification schemes over 30 
repetitions. The highest mean accuracies were found in all networks, ranging from 87.2 % to 97.4 %. Table 1 
shows a comparison of existing deep learning-based liver fibrosis diagnosis approaches with the performance of 
the proposed system on various imaging datasets. It should be noted that the proposed system outperformed the 
other existing systems in terms of performance. 


Table 1. Other methods are compared to the proposed method 


Study Method Imaging Accuracy % 
Lee et al. [11] DNN US 16% 
Dandan et al. [12] CNN US 80.9% 
Gatos et al. [13] SVM US SWE 87.0 
Gatos et al. [16] CNN US SWE 95% 
Liu et al. [14] SVM US SWE 96% 
Kagadis et al. [15] Resnet50 US SWE 97.4% 
Our model CNN-SVM US SWE 98.59 


The purpose of this paper was to use (US-SWE) images from healthy patients and patients with liver 
fibrosis to train a CNN algorithm to distinguish between healthy and fibrotic livers and classify the stages of 
fibrosis (multi-classification). The main contribution of the proposed system is the following: The proposed 
system focused on creating a machine learning-based hybrid model (CNN-SVM) to diagnose liver fibrosis in 
SWE images, adopting fewer parameters than the most modern deep learning models SVM, the proposed 
method is highly accurate, employing (CNN-SoftMax) or (CNN-SVM), which alleviates the difficulties 
associated with handling more fibrosis cases. 


2. METHOD 
2.1. Dataset gathering 

We have collected our data from a LOGIQ E9 ultrasound shear wave elastography imaging machine. 
The dataset consists of 700 US/SWE images (150 normal liver SWE images and F=150, F2=150, F3=114, 
F4=136). The datasets are split into two samples: for the training phase (90%) and the testing phase (10%). In 
order to establish a fixed size for all images in the dataset to be appropriate for further processing, the SWE 
images are scaled to a size of (256x256) pixels. 
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2.2. Dataset preparation 

We categorized our data before preprocessing. We've labeled the dataset as multistage because we're 
employing multi-classification (normal, F1, F2, F3, and F4). Normal tissue is healthy (i.e., there is no fibrosis), 
whereas other stages are fibrosis tissue. Stages F1 and F2 are mild fibrosis; stage F3 is moderate fibrosis, and 
stage F4 is severe fibrosis (cirrhosis). The images collected from the US/SWE were in digital imaging and 
communications in medicine (DICOM) format. The DICOM images contained metadata containing patient 
information. As a result, all metadata is lost when images are converted into bitmap format. Furthermore, 
because computer vision processing algorithms deal with images as raw pixel values, it's easier to convert 
images to the (.bmp) format before using them as input to the CNN model to save computational power. 


2.3. Data pre-processing 

Our data pre-processing consists of many parts shown in Figure 1. First step is Image cropping to a 
large texture ROI that only contains liver pixels as shown in Figure 2. To crop the images, we utilized the 
(finding extreme point in contour) technique using opencv in Python. Second, the images are three-channel 
colorful images red, green, and blue (RGB), so they are converted to grayscale images in order to reduce 
processing time (one channel). Third step is image enhancement. SWE image preprocessing is performed using 
the contrast limited adaptive histogram equalization (CLAHE) approach in this step. CLAHE is a contrast 
enhancement approach that efficiently improves image contrast, reduces noise and blurring, sharpens features, 
and displays image details like as edge detection without affecting the SWE image's inherent structure. Forth 
step is image resizing. The time it takes to train a neural network model is cut in half when images are resized. 
Using python and opencv, we resized the images. Finally, normalization: To speed up the processing, divide 
the image elements by 255 to create the values between 0 and 1. 


Original image Cropping Gray scale Enhancement 


see 


Normalization } : ( Resinase-20)| 


Figure 1. The pre-processing steps 


Step 4. Crop the image 


Step 1. Get the original image Step 2. Find the biggest contour Step 3. Find the extreme points 


Figure 2. Pre-processing process (cropping by contour detection algorithm) 


2.4. Proposed model 
The design of the CNN architecture illustrated in Figure 3.To detect and analyze available features 
in SWE images, a CNN with several convolution layers and a MaxPooling layer was used in this study. The 
following is the structure of the feature extraction: 
— The input layer is the first layer that contain the image of ultrasound (SWE image). 
— The convolutional layer: Conv2D was used in our proposed CNN model. In our model, we employed five 
conv2D layers. 
— The pooling layer: MaxPooling2D was utilized in this model. We utilized a MaxPool2D layer for each 
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Conv2D layer. As a result, we used five MaxPool2D layers. 

— The flatten layer: we used a flatten layer after using the pooling layer to flat the network system in our 
model. 

— The dense layer: we utilized dense layers after the flatten layer. 

— The activation function: we used the SoftMax function from (1) with the other dense layer and the rectified 
linear unit (ReLU) function from (2) [17]. 


ei 
a(Z) = aed (1) 
ReLU = max(0,x) (2) 
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Figure 3. Block diagram of the proposed system 
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The weights were modified with a categorical cross-entropy loss function and the optimizer is Adam 
with a learning rate of (0.001). The proposed model is summarized in Table 2. ReLu is an activation function 
for each convolutional layer. The pooling layer is a technique for down sampling a collection of nearby pixels 
into a single pixel. This down sampling reduced the size of the activation maps image from 4x4 to 2x2. Over- 
fitting can be solved by pooling layers, and MaxPool has proven to be the most effective [18]. 


Table 2 Exhibit the summary of CNN model 


No. Layer name Input layer Outputlayer _ Filter size Parameter 
1 Convolution 1 & ReLU (256, 256, 1) (256, 256, 20) (3x3), 1 560 
2 MaxPooling! & Dropout (0.1) (256, 256, 20) (128, 128, 20) (2x2), 2 0 
3 Convolution 2 & ReLU (128, 128, 20) (128, 128, 40) (3x3), 1 7240 
4 MaxPooling2 & Dropout (0.1) (128, 128, 40) (64, 64, 40) (2x2), 2 0 
5 Convolution 3 & ReLU (64, 64, 40) (64, 64, 80) (3x3), 1 28880 
6 MaxPooling3 &Dropout (0.1) (64, 64, 80) (32, 32, 80) (2x2), 2 0 
7 Convolution 4 & ReLU (32, 32, 80) (32, 32, 160) (3x3), 1 115360 
8 MaxPooling4 & Dropout (0.1) (32, 32, 16+0) (16, 16, 160) (2x2), 2 0 
9 Convolution 5 & ReLU (16, 16, 160) (16, 16, 320) (3x3), 1 461120 
10 MaxPoolingS & Dropout (0.1) (16, 16, 320) (8, 8, 320) (2x2), 2 0 
11 Fully connected layer (Flatten) (8, 8, 320) 20480 / 0 
12 Dense layer 1 (512) 20480 512 / 10486272 
13 Dropout layer (0.4) / / / 0 
14 Dense layer 2 (5) SoftMax 512 5 / 2565 
11,101,637 


2.5. Classification phase 

Features of SWE images are used to appropriately categorize patients whether they are infected or not 
for the diagnosis of liver fibrosis infected cases. Classification is utilized to introduce diagnosis prediction 
(normal, fibrosis stages (F1, F2, F3, F4)) based on the extracted attribute. This research employs two classifiers 
(SoftMax and SVM). 


2.5.1. SoftMax classifier 

After the flatten layer, the dense architecture has one fully connected layer with an activation function 
(ReLu). The activation function for the dropout layer with a ratio of (0.4) and the dense layer for classification 
was SoftMax. Since SoftMax values are in between 0 and 1, and if all of the features are performed using the 
formula as (3), the result is 1. 


softmax (xi) = exp (xi) \ =o exp (xj) (3) 


X i is the production associated with class I, where j is the number of classes. The SoftMax function is 
extensively used to estimate probabilities and conduct multi-classifications [19]. 


2.5.2. Support vector machine classifier 

SVM is the second classification method. To provide results for identifying SWE images, the SVM 
classifier was added to the retrieved feature of the fully connected layer of CNN. The standardization of the 
feature values obtained (feature vector) from CNN, which speeds up processing because these characteristics 
will be within a specified range after standardization. SWE image classification is split into two stages. Using 
the trained CNN model, the features of the input data are automatically extracted during the SVM's training 
phase. To complete SVM training, the retrieved representative feature data and their related labels are fed into 
the model [20]. 


2.6. Proposed model's performance 
Precision, recall, f1 score, and accuracy were used to assess the performance of our suggested model. 
The following are the accuracy, fl-score, and precision equations: 


Accuracy equation [21]-[24] = TP + = +TN+P+N (4) 
Sensitivity equation [21]-[23]=TP+TN/TP+N (5) 
Precision equation [21]-[23] =TP + TN/TP + P (6) 
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Fl-score equation [21 ]-[23], [25] =(PrecisionxSensitivity) / (Precision+Sensitivity) (7) 


where: TP=correct positive, TN=true negative, P=positive condition, N=negative condition. 

Figure 4 displays the proposed model's main categorization criteria. The following are the parameters 
of the given model and test SWE images: true positive is a correct classification of a positive prognosis, such 
as normal diagnosed as normal, false positive is an incorrect classification of a positive diagnosis, such as 
fibrosis liver diagnosed as normal, true negative is a correct classification of a negative diagnosis, such as 
fibrosis liver diagnosed as fibrosis liver, and false negative is an incorrect classification of a negative diagnosis, 
such as normal diagnosed as fibrosis liver. 


3. RESULTS AND DISCUSSION 
3.1. Requirements for software and hardware 

The proposed liver fibrosis classification system is based on a Lenovo personal computer with 
specifications that include NVidia GTX 1650 4GB, AMD Ryzen5 5600H CPU, 512GB SSD storage, and 16GB 
RAM. TensorFlow open-source framework (a Google open-source programming library that is concentrated 
on supporting tensors to produce successful work), Keras (a neural network library based on TensorFlow 
written in Python, and also open-source), and Python (Python version 3.9.7, Jupyter notebook) are used for the 
implementation of the CNN code. The system relied on open source libraries such as Open, Scikit Learn, and 
Pandas. These libraries are dedicated to dealing with machine learning and data analysis. 


3.2. Results of the features extraction steps 

The feature extraction steps are carried out using CNN, which is made up of five successful 
convolutional blocks, each of which has four layers: a convolutional layer with a rectified linear unit (ReLU) 
activation function, a MaxPooling layer, and a dropout layer. The CNN model's convolution layer extracts 
features or manifestations of SWE images by passing many different filters over a SWE image, such as edge 
detection, vertical, horizontal, and diagonal lines, curves, corners, color contrast sensitive filters, and so on. 
Each of these filters generates a feature map, which shows the spatial patterns where the feature is located. The 
feature map is then passed through a non-linear activation function ReLU. Each filter has a size of (3,3), and 
the number of filters in each layer is (20, 40, 80, 160, 320). The MaxPooling layer reduces computational 
power by reducing the dimensions of features maps to half because the kernel size is equal to (2, 2). This 
contributes to reducing the over-fitting problem. 


3.3. Result of shear wave elastography image classification using SoftMax function 

The extracted features are passed to the layers for classification after the feature extraction stage. The 
confusion matrix of features extracted by CNN is shown in Figure 4. The confusion matrix of the test sample 
yielded the highest accuracy of 97.18%, sensitivity of 97.2%, precision of 97.2%, and an F1 score of 97.4%. 
To highlight the high outcomes of our suggested model, as shown Figure (5) we plotted model accuracy in 
Figure 5(a) and model loss in Figure 5(b). As illustrated in these graphs, our proposed CNN model is neither 
under-fit nor over-fit. This explains why, in terms of precision, recall, and fl-score, we surpassed the 
competition. 
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Figure 4. Confusion matrix of the test sample for SoftMax classifier 
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Figure 5. To highlight the high outcomes of our suggested model; (a) model accuracy graph and (b) model 
loss graph 


3.4. Result of shear wave elastography image classification using support vector machine 

The features from the dense layer 1 of the CNN will be classified using "linear" as the kernel function 
in the support vector machine. The accuracy of the SVM classifier is 98.59%. The confusion matrix using the 
SVM classifier when the model is trained on a dataset. The confusion matrix of the test sample for the SVM 
classifier is shown in Figure 6. 
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Figure 6. Confusion matrix of the test sample for SVM classifier 


3.5. Comparison results between CNN-SoftMax and CNN-SVM 

Table 3 compares the results of performance measures using CNN-SoftMax to the results of 
performance measures using CNN-SVM, showing that CNN-SVM outperformed CNN- SoftMax. CNN have 
convolutional layers, which contain important filters for extracting features from the SWE image and have thus 
been used to extract features. The SVM is built on the principle of finding the optimal hyper plane for separating 
a data set into two groups in an efficient way. To maximize the results of the margin (the distance between the 
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hyper plane and the closest point of any of the data sets), the SVM provides the ability of generalization better 
compared to using SoftMax function as a classifier. 


Table 3. Comparison results between CNN-SoftMax and CNN-SVM 


Method Sensitivity Precision _Fl_score Accuracy 
CNN-SoftMax 97.2 97.2 97.4 97.18 
CNN-SVM 98.6 98.6 98.8 98.5 


4. CONCLUSION 

The proposed system addresses the issue of detecting liver fibrosis in US/SWE liver images. The 
preprocessing phase is essential for working efficiently with CNN and producing higher-quality results. On 
SWE images, the proposed method was evaluated using the CNN-SoftMax function and CNN-SVM classifiers; 
accuracy was 98.5%with CNN-SVM and 97.18% with CNN-SoftMax, respectively. This study compared the 
CNN-SoftMax and CNN-SVM algorithms, and the experimental results show that the CNN-SVM algorithm 
performs better than the CNN-SoftMax algorithm. The proposed method, which employs CNN-Softmax or 
CNN-SVM, is highly accurate, reducing the problems associated with dealing with more liver fibrosis. The 
model achieved higher accuracy with fewer parameters (11,101,637) and less computational power. VGG16 
is trained and tested using the same dataset. Additionally, the proposed system outperformed it. Finally, in 
future work, we will expand the capabilities of the proposed system by using a large dataset of US/SWE images 
(thousands of images). 
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